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(54) Title: SUBTILIS IN MUTANTS 



(57) Abstract 

Novel carbonyl hydro- 
lase mutants derived from the 
DNA sequences of naturally- 
occurring or recombinant 
non-human carbonyl hydro- 
lases are disclosed. The mut- 
ant carbonyl hydrolases, in 
general, are obtained by in 
vitro modification of a prec- 
ursor DNA sequence encod- 
ing the naturally-occurring or 
recombinant carbonyl hydro- 
lase to generate the substitu- 
tion of one or more amino ac- 
id residues in the amino acid 
sequence of a precursor carb- 
onyl hydrolase. Such mutant 
carbonyl hydrolases have 
porperties which are different 
from those of the precursor 
hydrolase and are especially 
useful in detergent formula- 
tions. The substituted amino 
acid residues correspond to 
position +123 and/or +274 
in Bacillus amyloliquefaciens 
subtilisin. 
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SUBTILISIN MUTANTS 
Field of the Invention 

The present invention relates to novel carbonyl 
hydrolase mutants having an amino acid sequence wherein 
5 one or more amino acid residues of- a precursor carbonyl 
hydrolase, specifically those at positions corresponding 
to residues +123 and/or +274 in Bacillus 
amvloliquefaciens subtilisin, have been substituted with 
a different amino acid. Such mutant carbonyl 

10 hydrolases, in general, are obtained by in vitro 
modification of a precursor DNA sequence encoding a 
naturally-occurring or recombinant carbonyl hydrolase 
to encode the substitution of one or both of these amino 
acid residues in a precursor amino acid sequence alone 

15 or in combination with other substitution, insertion or 
deletion in the precursor amino acid sequence. 

Background of the Invention 

Serine proteases are a subgroup of carbonyl hydrolase. 
They comprise a diverse class of enzymes having a wide 

20 range of specificities and biological functions. 
Stroud, R. M. (1974), Sci. Amer. , 131, 74-88. Despite 
their functional diversity, the catalytic machinery of 
serine proteases has been approached by at least two 
genetically distinct families of enzymes: the 

25 subtil is ins and the mammalian chymotrypsin related and 
homologous bacterial serine proteases (e.g. , trypsin and 
£. qresius trypsin) . These two families of serine 
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proteases show remarkably similar mechanisms of 
catalysis. Kraut, J. (1977), Ann. Rev. Biochem. . 16, 
331-358, Furthermore, although the primary structure 
is unrelated, the tertiary structure of these two enzyme 
5 families bring together a conserved catalytic triad of 
amino acids consisting of serine, histidine and 
aspartate* 

Subtilisin is a serine endoprotease (MW 27,500) which 
is secreted in large amounts from a wide variety of 

10 Bacillus species and other microorganisms. The protein 
sequence of subtilisin has been determined from at least 
four different species of Bacillus . Markland, F.S., et 
al. (1983) , Honne-Sevler's Z. P hvsiol. Chem. . £M# 1537- 
1540. The three-dimensional crystallographic structure 

15 of Bacillus amvloliouefaciens subtilisin to 2.5A 
resolution has also been reported, Wright, C.S. , et al. 
(1969), Nature . 221 . 235-242; Drenth, J., et al. (1972), 
Eur. J. Biochem. . 26 . 177-181. These studies indicate 
that although subtilisin is genetically unrelated to the 

20 mammalian serine proteases, it has a similar active site 
structure. The x-ray crystal structures of subtilisin 
containing covalently bound peptide inhibitors 
(Robertus, J.D., et al. (1972), Biochemistry . H, 2439- 
2449), or product complexes (Robertus, J.D., et al. 

25 (1976), J. Biol. Chem. . 251 » 1097-1103), have also 
provided information regarding the active site and 
putative substrate binding cleft of subtilisin. In 
addition, a large number of kinetic and chemical 
modification studies have been reported for subtilisin 

30 (Philipp, K., et al. (1983), Mol. Cell. Biochem.. £1, 
5-32; S vends en, B. (1976), Carlsbera Res. Comm.. £1, 
237-291; Markland, F.S. ££.) as well as at least one 
report wherein the side chain of methione at residue 222 
of subtilisin was converted by hydrogen peroxide to 

35 methionine-sulfoxide (Stauffer, D.C., et al. (1965), 
Biol. Chem. . 244 . 5333-5338) and the side chain of 
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serine at residue 221 converted to cysteine by chemical 
modification (Polgar, et al. (1981), Biochimica et 
Biophysics Acta, ££7, 351-354.) 

U.S. Patent No. 4,760,025 and EPO Publication 
5 Mo. 0 130 756 published January 9, 1985 each disclose 
the modification of subtil isin amino acid residues 
corresponding to positions in B&ciUHS amylPllqyef frcien 
subtilisin tyrosine -1, aspartate +32, asparagine +155, 
tyrosine +104, methionine +222, glycine +166, histidine 

10 +64, glycine +169, phenylalanine +189, serine +33, 
serine +221, tyrosine +217, glutamate +156 and alanine 
+152. EPO Publication No. 0 251 446 published 
January 7, 1988 discloses other amino acid residues in 
ppciUus amvloliauefaciens subtilisin and their 

15 equivalents which may be modified by way of 
substitution, insertion or deletion and which may be 
combined with modifications to the residues identified 
in U.S. Patent No. 4,760,025 to form useful subtilisin 
mutants. The particular residues identified herein, 

20 however, are not identified in these references. 

Similarly, PCT Publication No. W0 89/09819 and 
W0 89/09830 each published October 19, 1989, disclose 
subtilisin enzymes made by mutating a nucleotide 
sequence coding for a subtilisin. Numerous amino acid 
25 residues are identified in each of these publications 
which may be so modified. However, as with the 
previously identified references, neither identifies the 
residues of the present invention. 

Accordingly, it is an object herein to provide carbonyl 
30 hydrolase mutants containing the substitution of amino 
acid residues in a precursor carbonyl hydrolase 
corresponding to positions +123 and/or +274 in Bacillus 
amvloliauefaciens subtilisin. Such mutants generally 
have at least one property which is different from the 



WO 91/06637 PCT/US90/06084 

-4- 

same property of the carbonyl hydrolase precursor from 
which the amino acid of said mutant is derived. 

It is further object to provide DNA sequences encoding 
such carbonyl hydrolase mutants as well as expression 
5 vectors containing such mutant DNA sequences. 

Still further, another object of the invention is to 
provide host cells transformed with such vectors as well 
as host cells which are capable of expressing such DNA 
to produce carbonyl hydrolase mutants either 
10 intracellular^ or extracellularly. 

yiinmarv of the Invention 

The invention includes non-naturally occurring carbonyl 
hydrolase mutants having a different proteolytic 
activity, stability, and/or performance characteristic 

15 as 'compared to the precursor carbonyl hydrolase from 
which the amino acid sequence of the mutant is derived. 
The precursor carbonyl hydrolase may be a naturally- 
occurring carbonyl hydrolase or recombinant hydrolase. 
Specifically, such carbonyl hydrolase mutants have an 

20 amino acid sequence, not found in nature, which is 
derived by replacement of one or more amino acid 
residues of a precursor carbonyl hydrolase with one or 
more different amino acids. The one or more amino acid 
residues of the precursor enzyme correspond to positions 

25 Asn +123 and/or Ala +274 of Bacillus jWVlPUqygfggiens 
subtilisin or equivalent amino acid residues in other 
carbonyl hydrolases or subtilisins. 

The invention also includes mutant DNA sequences 
encoding such carbonyl hydrolase or subtilisin mutants. 

30 These mutant DNA sequences are derived from a precursor 
DNA sequence which encodes a naturally-occurring or 
recombinant precursor enzyme. The mutant DNA sequences 
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are derived by modifying the precursor DNA sequence to 
encode the substitution of one or more specific amino 
acid residues encoded by the precursor DNA sequence 
corresponding to position +123 and/or +274 in Bacillus 
5 flfflvloliouefaciens. These recombinant DNA sequences 
encode carbonyl hydrolase mutants having a novel amino 
acid sequence and, in general, at least one property 
which is substantially different from the same property 
of the enzyme encoded by the precursor carbonyl 
10 hydrolase DNA sequence. Such properties include 
proteolytic activity, stability and/or enhanced 
performance characteristics. 



The invention also includes procaryotic and eucaryotic 
subtilisins with a different amino acid residue such as 
15 serine, at positions equivalent to Asn +123 in Bacillus 
amvloliouef aniens subtilisin and to subtilisin with 
different amino acid residues at positions equivalent 
to position +274 in Bacillus amvloliouefari 
subtilisin. 

20 Further, the invention includes expression vectors 
containing such mutant carbonyl hydrolase DNA sequences 
as well as host cells transformed with such vectors 
which are capable of producing such mutants. The 
invention also relates to detergent compositions 

25 comprising the carbonyl hydrolase mutants of the 
invention. 



Brief Description of the n^-ir^ 

Fig. 1 depicts the DNA and amino acid sequence for 
PaciUys amvloliguef aff i ?nf subtilisin and a partial 
30 restriction map of this gene. 
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Pig. 2 depicts the conserved amino acid residues among 
subtilisins from Bacillus aTwIoUguefacjens, Bacillus 
pubtilis varI166 and Bacillus lichenifprrols 
(carlsbergensis) . 

5 Figs. 3A and 3B depict the amino acid sequence of 
subtilisin from Bacillus ninvlol Iffuefacjens, pacillu . s 
subtilis varI168 and Bacillus } i,ch.eniforTOis. 

Fig. 4 depicts the amino acid sequence of three 
subtilisins. The top line represents the amino acid 

10 sequence of subtilisin from BacjUys afflVl nl 1 qu.e£acl ens 
subtilisin (also sometimes referred to as subtilisin 
BPN' ) . The second line depicts the amino acid sequence 
of subtilisin from Bacillus leniUS (subtilisin 309 in 
PCT Publication No. WO 89/06276). The bottom line 

15 represents the amino acid sequence of a preferred 
embodiment of the invention designated GG-RYSA. The 
symbol * denotes the absence of specific amino acid 
residues as compared to subtilisin BPN 1 . 

Fig. 5 depicts the construction of plasmid pGG A274. 

20 Fig. 6 depicts the construction of pGG-KVNA which is an 
intermediate to plasmid pGG-RYSA. 

Fig. 7 depicts the oligonucleotide-duplex method used 
to construct a synthetic Bacillus lenfrus subtilisin 
gene. 

25 Fig. 8 depicts the strategy for constructing a synthetic 
gene encoding Bacillus lent-us subtilisin. 

Fig. 9 depicts the cassette used to make substitutions 
in the DNA at codon position +123 by cassette 
mutagenesis. XXX represents the codon modified to 
30 encode the amino acid substitutions at position +123. 
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Fig. 10 depicts the DNA and amino acid sequence of a 
preferred embodiment of the invention wherein the DNA 
sequence is a synthetic DNA. The DNA in this Figure has 
been modified to encode arginine at position 27, serine 
5 at position 78, tyrosine at position 104, serine at 
position 123 and alanine at position 274. 

Detailed Description of the Invention 
It has been discovered that jLn vitro mutations in the 
carbonyl hydrolase subtil isin at an amino acid residue 
10 equivalent to +123 in Bacillus amvloliouef aciens 
subtilisin produces subtilisin mutants exhibiting 
altered proteolytic activity over precursor subtilisins. 

It has also been discovered that in vitro mutation at 
residues equivalent to +274 in Bacillus 
15 amvloliguef aciens subtilisin produce subtilisin mutants 
exhibiting altered stability, e.g. modified 
autoproteolytic stability . In some instances, these 
latter mutants also exhibit enhanced performance when 
used in detergent compositions. 

20 Carbonyl hydrolases are enzymes which hydrolyze 
compounds containing 

O 
II 

c-x 

25 bonds in which X is oxygen or nitrogen. They include 
naturally-occurring carbonyl hydrolases and recombinant 
carbonyl hydrolases. Naturally-occurring carbonyl 
hydrolases principally include hydrolases, e.g. peptide 
hydrolases, such as subtilisins or meta llopro tease s . 

30 Peptide hydrolases include a-aminoacylpeptide hydrolase, 
peptidylamino acid hydrolase, acylamino hydrolase, 
serine carboxypeptidase, metallocarboxypeptidase, thiol 
proteinase, carboxylproteinase and metalloproteinase. 
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Serine, metallo, thiol and acid proteases are included, 
as veil as endo and exo-proteases . 

"Recombinant carbonyl hydrolase" refers to a. carbonyl 
hydrolase in which the DNA sequence encoding the 
5 naturally-occurring carbonyl hydrolase is modified to 
produce a mutant DNA sequence which encodes the 
substitution, insertion or deletion of one or more amino ? 
acids in the carbonyl hydrolase amino acid sequence. 
Suitable modification methods are disclosed herein, in 
10 EPO Publication No. 0 130 756 published January 9, 1985 
and EPO Publication No. 0 251 446 published January 7, 
1988. 

Subtilisins are bacterial or fungal carbonyl hydrolases 
which generally act to cleave peptide bonds of proteins 

15 or peptides. As used herein, "subtilisin" means a 
naturally-occurring subtilisin or a recombinant 
subtilisin. A series of naturally-occurring subtilisins 
is known to be produced and often secreted by various 
microbial species. Amino acid sequences of the members 

20 of this series are not entirely homologous. However, 
the subtilisins in this series exhibit the same or 
similar type of proteolytic activity. This class of 
serine proteases shares a common amino acid sequence 
defining a catalytic triad which distinguishes them from 

25 the chymotrypsin related class of serine proteases. The 
subtilisins and chymotrypsin related serine proteases 
both have a catalytic triad comprising aspartate, 
histidine and serine. In the subtilisin related 
proteases the relative order of these amino acids, * 

30 reading from the amino to carboxy terminus is aspartate- 

histidine-serine. In the chymotrypsin related proteases > 
the relative order, however is histidine-aspartate- 
serine. Thus, subtilisin herein refers to a serine 
protease having the catalytic triad of subtilisin 

35 related proteases. Examples include the subtilisins 
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identified in Fig. 3 herein and as described in PCT 
Publication WO 89/06276 and EPO Publication 
NO. 0 283 075. 

"Recombinant subtilisin" refers to a subtilisin in which 
5 the DNA sequence encoding the subtilisin is modified to 
produce a mutant DNA sequence which encodes the 
substitution, deletion or insertion of one or more amino 
acids in the naturally-occurring subtilisin amino acid 
sequence. Suitable methods to produce such modification 
10 and which may be combined with those disclosed herein 
include those disclosed in EPO Publication 
Kos. 0 130 756 and 0 251 44 6 and PCT Publication Nos. 
WO 89/06279, W0 89/09830 and W0 89/09819. 

"Non-human carbonyl hydrolases" and the DNA encoding 

15 then may be obtained from many procaryotic and 
eucaryotic organisms. Suitable examples of procaryotic 
organisms include gram negative organisms such as £. 
coli or Pseudomonas and gram positive bacteria such as 
Micrococcus or Bacillus. Examples of eucaryotic 

20 organisms from which carbonyl hydrolase and their genes 
may be obtained include yeast such as Saccaromvcees 
cerevis;iae , fungi such as Aspergillus S p. , and non-human 
mammalian sources such as, for example, bovine sp. from 
which the gene encoding the carbonyl hydrolase chymosin 

25 can be obtained. As with subtilisins, a series of 
carbonyl hydrolases can be obtained from various related 
species which have amino acid sequences which are not 
entirely homologous between the members of that series 
but which nevertheless exhibit the same or similar type 

30 of biological activity. Thus, non-human carbonyl 
hydrolase as used herein has a functional definition 
which refers to carbonyl hydrolases which are 
associated, directly or indirectly, with procaryotic and 
eucaryotic sources. 
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A "carbonyl hydrolase mutant" has an amino acid sequence 
which is derived from the amino acid sequence of a 
"precursor carbonyl hydrolase" . The precursor carbonyl 
hydrolases include naturally-occurring carbonyl 
5 hydrolases and recombinant carbonyl hydrolases. The 
amino acid sequence of the carbonyl hydrolase mutant is 
"derived" from the precursor hydrolase amino acid 
sequence by the substitution, deletion or insertion of 
one or more amino acids of the precursor amino acid 

10 sequence. Such modification is- of the "precursor DNA 
sequence" which encodes the amino acid sequence of the 
precursor carbonyl hydrolase rather than manipulation 
of the precursor carbonyl hydrolase enzyme e£*r ££• 
Suitable methods for such manipulation of the precursor 

15 DNA sequence include methods disclosed herein and in EPO 
Publication Nos. 0 130 756 and 0 251 446. 

Specific residue's corresponding to positions +123 and 
+274 of Bacillus amvloliou ef aciens subtilisin are 
identified herein for substitution. These amino acid 

20 position numbers refer to those assigned to the mature 
Bacillus amvlol iouef aciens subtilisin sequence presented 
in Fig, 1. The invention, however, is not limited to 
the mutation of this particular subtilisin but extends 
to precursor carbonyl hydrolases containing amino acid 

25 residues at positions which are "equivalent" to the 
particular identified residues in Bacillus 
amvlol iquef aciens subtilisin. 

A residue (amino acid) of a precursor carbonyl hydrolase 
is equivalent to a residue of Bacillus aravloliouef aciens 

30 subtilisin if it is either homologous (i.e., 
corresponding in position in either primary or tertiary 
structure) or analogous to a specific residue or portion 
of that residue in Bacillus amvloliouef aciens subtilisin 
(i.e., having the same or similar functional capacity 

35 to combine, react, or interact chemically) . 
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In order to establish homology to primary structure, the 
amino acid sequence of a precursor carbonyl hydrolase 
is directly compared to the Bacillus amvlolicruefaciens 
subtilisin primary sequence and particularly to a set 
5 of residues known to be invariant in all subtilisins for 
which sequence is known (Fig, 2). After aligning the 
conserved residues , allowing for necessary insertions 
and deletions in order to maintain alignment (i.e., 
avoiding the elimination of conserved residues through 

10 arbitrary deletion and insertion) , the residues 
equivalent to particular amino acids in the primary 
sequence of p^cUlus amvlolicruefaciens subtilisin are 
defined. Alignment of conserved residues preferably 
should conserve 100% of such residues. However, 

15 alignment of greater than 75% or as little as 50% of 
conserved residues is also adequate to define equivalent 
residues. Conservation of the catalytic triad, 
Asp32/His64/Ser221 should be maintained. 

For example, in Fig. 3 the amino acid sequence of 
20 subtilisin from pacillus amvloliouefaciens. Bacillus 
pubtlljis var. 1168 and Bacillus lichenformis 
(carlsbergensis) are aligned to provide the maximum 
amount of homology between amino acid sequences. A 
comparison of these sequences shows that there are a 
25 number of conserved residues contained in each sequence. 
These are the residues identified in Fig. 2. 

These conserved residues thus may be used to define the 
correspbnding equivalent amino acid residues of Bacillus 
amvloliouefaciens subtilisin in other carbonyl 

30 hydrolases such as subtilisin from Bacillus lentus (PCT 
Publication No. W089/06279 published July 13, 1989) and 
the preferred subtilisin mutant herein. These 
particular amino acid sequences are aligned in Fig. 4 
with the seguence of Bacillus amvlolimi efaciens 

35 subtilisin to produce the maximum homology of conserved 
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residues. As can be seen there are a number of 
deletions in the sequence of Bacillus lentus and in the 
preferred subtilisin mutant of the invention as compared 
to BacUlVlP ?MVlPlicpjefaciens subtilisin. Thus, the 
5 equivalent amino acid for Val-165 in Bacillus 
amvloliouefaciens subtilisin in the other subtilisins 
is the particular isoleucine shown beneath Val-165* 

In Fig. 4, the amino acid at position 123 is asparagine 
in Bacillus amvloliouefaciens subtilisin. In BftgiUttP 

10 lentus subtilisin the equivalent residue is the 
particular asparagine shown. In the preferred 
subtilisin mutant of the invention, however, the amino 
acid equivalent to +123 in Bacillus amvloliouefaciens 
subtilisin is an amino acid other than asparagine and 

15 is preferably the serine shown in Fig. 4. Similarly, 
in Fig. 4, the amino acid at position +274 Bacillus 
amvloliouefaciens subtilisin is alanine. As can be 
seen, the equivalent amino acid in Bacillus lentus 
subtilisin is the particular threonine shown in Fig. 4. 

20 In a particular preferred subtilisin mutant of the 
invention, the equivalent amino acid position 274 is 
occupied by the alanine shown in Fig. 4. 

Thus, the positions +123 and +274 are identified by 
primary amino acid sequences in Fig. 4. for the 

25 subtilisin from Bacillus lentus and the preferred 
embodiment of the invention. However, various other 
amino acid residues may be modified which are equivalent 
to specific amino acids in Bacillus amvlol iouef aciens 
subtilisin. Thus, in the preferred embodiment, the 

30 amino acid lysine at position 27 in Bacillus 
amvloliquef aciens subtilisin has an equivalent lysine 
at position 27 in Bacillus lentus subtilisin. As 
indicated in the Examples, the subtilisin comprising one 
of the preferred embodiments of the invention was 

35 derived by modifying a DNA sequence encoding Bacillus 
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lentus subtilisin. Such modifications to the DNA 
included the modification of codons equivalent to 
positions 123 and 274 of Bacillus amvloliouef aciens 
subtilisin. However, two other modifications were made 

5 to the ftacillus lentus amino acid sequence at positions 
equivalent to residues 27 and 104 in Bacillus 
amvlolicruefaciens subtilisin. Thus, as can be seen in 
Fig. 4, the lysine at equivalent residue 27 in BapillV 6 
lentus subtilisin was modified to encode arginine in the 
10 preferred embodiment. Similarly, the valine residue at 
position 104 of Bacillus IgiftW* which is equivalent to 
tyrosine 104 in Bacillus apvlolicruef aciens subtilisin, 
was also modified to encode tyrosine. Thus, the 
preferred embodiment shown in Fig. 4 contains an amino 

15 acid sequence derived from Bacillus jentys subtilisin 
by modifying residues of that subtilisin equivalent to 
positions 27, 104, 123 and 274 of papjllVg 
amvlolioue f aciens subtilisin. 

Equivalent residues may also be defined by determining 

20 homology at the level of tertiary structure for a 
precursor carbonyl hydrolase whose tertiary structure 
has been determined by x-ray crystallography. 
Equivalent residues are defined as those for which the 
atomic coordinates of two or more of the main chain 

25 atoms of a particular amino acid residue of the 
precursor carbonyl hydrolase and Bflcillvs 
amvlol iauef aciens subtilisin (N on N, CA on CA, C on C, 
and 0 on O) are within 0.13nm and preferably O.lnm after 
alignment. Alignment is achieved after the best model 

30 has been oriented and positioned to give the maximum 
overlap of atomic coordinates of non-hydrogen protein 
atoms of the carbonyl hydrolase in question to the 
Bacillus amvlol ioue f aciens subtilisin. The best model 
is the crystallographic model giving the lowest R factor 

35 for experimental diffraction data at the highest 
resolution available. 
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Equivalent residues which are functionally analogous to 
a specific residue of Bacillus amvloliouefacieTiB 
subtilisin are defined as those amino acids of the 
precursor carbonyl hydrolases which may adopt a 
5 conformation such that they either alter, modify or 
contribute to protein structure, substrate binding or 
catalysis in a manner defined and attributed to a 
specific residue of the lacllias amvloliouefaeiens 
subtilisin. Further, they are those residues of the 

10 precursor carbonyl hydrolase (for which a tertiary 
structure has been obtained by x-ray crystallography) , 
which occupy an analogous position to the extent that 
although the main chain atoms of the given residue may 
not satisfy the criteria of equivalence on the basis of 

15 occupying a homologous position, the atomic coordinates 
of at least two of the side chain atoms of the residue 
lie with 0.l3nm of the corresponding side chain atoms 
of Bacillus amvlolicm efacienfi subtilisin. The 
coordinates of the three dimensional structure of 

20 Bacillus amvlolicmefaeiens subtilisin are set forth in 
EPO Publication No. o 251 446 and can be used as 
outlined above to determine equivalent residues on the 
level of tertiary structure. 

Some of the residues identified for substitution, 
25 insertion or deletion are conserved residues whereas 
others are not. In the case of residues which are not 
conserved, the replacement of one or more amino acids 
is limited to substitutions which produce a mutant which 
has an amino acid sequence that does not correspond to 
30 one found in nature, in the case of conserved residues, 
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such replacements should not result in a naturally- 
occurring sequence. The carbonyl hydrolase mutants of 
the present invention include the mature forms of 
carbonyl hydrolase mutants as well as the pro- and 
5 prepro-forms of such hydrolase mutants. The prepro- 
forms are the preferred construction since this 
facilitates the expression, secretion and maturation of 
the carbonyl hydrolase mutants. 

"Prosequence" refers to a sequence of amino acids bound 
10 to the N-terminal portion of the mature form of a 
carbonyl hydrolase which when removed results in the 
appearance of the "mature" form of the carbonyl 
hydrolase. Many proteolytic enzymes are found in nature 
as translational proenzyme products and, in the absence 
15 of post-translational processing, are expressed in this 
fashion. A preferred prosequence for producing carbonyl 
hydrolase mutants, specifically subtilisin mutants, is 
the putative prosequence of Bacillus amyloliQuefaciens 
subtilisin although other subtilisin prosequences may 
20 be used. In the Examples, the putative pro sequence 
from the subtilisin from Bacillus lentus (ATCC 21536) 
was used. 

A "signal sequence" or "presequence" refers to any 
sequence of amino acids bound to the N-terminal portion 

25 of a carbonyl hydrolase or to the N-terminal portion of 
a prohydrolase which may participate in the secretion 
of the mature or pro forms of the hydrolase. This 
definition of signal sequence is a functional one, meant 
to include all those amino acid sequences, encoded by 

30 the N-terminal portion of the subtilisin gene or other 
secretable carbonyl hydrolases, which participate in the 
effectuation of the secretion of subtilisin or other 
carbonyl hydrolases under native conditions. The 
present invention utilizes such sequences to effect the 

35 secretion of the carbonyl hydrolase mutants as defined 
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herein • A preferred signal sequence used in the 
Examples comprises the first seven amino acid residues 
of the signal sequence from Bacillus subtil is subtilisin 
fused to the remainder of the signal sequence of the 
5 subtilisin from Bacillus lentus (ATCC 21536) . 

A "prepro" form of a carbonyl hydrolase mutant consists 
of the mature form of the hydrolase having a prosequence 
operably linked to the amino-terminus of the hydrolase 
and a "pre" or "signal" sequence, operably linked to the 
10 amino terminus of the prosequence. 

"Expression vector" refers to a DNA construct containing 
a DNA sequence which is operably linked to a suitable 
control sequence capable of effecting the expression of 
said DNA in a suitable host. Such control sequences 

15 include a promoter to effect transcription, an optional 
operator sequence to control such transcription, a 
sequence encoding suitable mRNA ribosome binding sites, 
and sequences which control termination of transcription 
and translation. The vector may be a plasmid, a phage 

20 particle, or simply a potential genomic insert. Once 
transformed into a suitable host, the vector may 
replicate and function independently of the host genome, 
or may, in some instances, integrate into the genome 
itself. In the present specification, "plasmid" and 

25 "vector" are sometimes used interchangeably as the 
plasmid is the most commonly used form of vector at 
present. However, the invention is intended to include 
such other forms of expression vectors which serve 
equivalent functions and which are, or become, known in 

30 the art. 

The "host cells" used in the present invention generally 
are procaryotic or eucaryotic hosts which preferably 
have been manipulated by the methods disclosed in EPO 
Publication No. 0 130 756 to render them incapable of 
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secreting enzymatically active endoprotease. A 
preferred host cell for expressing subtilisin is the 
Bacillus strain BG2036 which is deficient in 
enzymatically active neutral protease and alkaline 
5 protease (subtilisin). The construction of strain 
BG2036 is described in detail in EPO Publication 
No. 0 130 756 and further described by Yang, M.Y., et 
al. (1984) , J. Bacterid. . 160 , 15-21. Other host cells 
for expressing subtilisin include Bacillus subtilis 1168 
10 (EPO Publication No. 0 130 756). 

Host cells are transformed or trans feet ed with vectors 
constructed using recombinant DNA techniques. Such 
transformed host cells are capable of either replicating 
vectors* encoding the carbonyl hydrolase mutants or 
15 expressing the desired carbonyl hydrolase mutant. In 
the case of vectors which encode the pre or prepro form 
of the carbonyl hydrolase mutant, such mutants, when 
expressed, are typically secreted from the host cell 
into the host cell medium. 

20 "Operably linked" when describing the relationship 
between two DNA regions simply means that they are 
functionally related to each other. For example, a 
presequence is operably linked to a peptide if it 
functions as a signal sequence, participating in the 

25 secretion of the mature form of the protein most 
probably involving cleavage of the signal sequence. A 
promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence? a ribosome 
binding site is operably linked to a coding sequence if 

30 it is positioned so as to permit translation. 

The genes encoding the naturally-occurring precursor 
carbonyl hydrolase may be obtained in accord with the 
general methods described in EPO Publication Nos. 
0 130 756 and 0 251 446. As can be seen from the 
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examples disclosed therein, the methods generally 
comprise synthesizing labelled probes having putative 
sequences encoding regions of the hydrolase of interest, 
preparing genomic libraries from organisms expressing 
5 the hydrolase, and screening the libraries for the gene 
of interest by hybridization to the probes. Positively 
hybridizing clones are then mapped and sequenced. 

The cloned carbonyl hydrolase is then used to transform 
a host cell in order to express the hydrolase. The 

10 hydrolase gene is then ligated into a high copy number 
plasmid. This plasmid replicates in hosts in the sense 
that it contains the well-known elements necessary for 
plasmid replication: a, promoter operably linked to the 
gene in question (which may be supplied as the gene's 

15 own homologous promotor if it is recognized, i.e., 
transcribed, by the host) , a transcription termination 
and polyadenylation region (necessary for stability of 
the mRNA transcribed by the host from the hydrolase gene 
in certain eucaryotic host cells) which is exogenous or 

20 is supplied by the endogenous terminator region of the 
hydrolase gene and, desirably, a selection gene such as 
an antibiotic resistance gene that enables continuous 
cultural maintenance of plasmid-inf ected host cells by 
growth in antibiotic-containing media. High copy number 

25 plasmids also contain an origin of replication for the 
host, thereby enabling large numbers of plasmids to be 
generated in the cytoplasm without chromosomal 
limitations. However., it is within the scope herein to 
integrate multiple copies of the hydrolase gene into 

30 host genome. This is facilitated by procaryotic and 
eucaryotic organisms which are particularly susceptible 
to homologous recombination. 

Alternatively, a synthetic gene encoding a naturally- 
occurring or mutant precursor carbonyl hydrolase may be 
35 produced. In such an approach, the DNA and/or amino 



WO 91/06637 



19 



PCT/US90/06084 



acid sequence of the precursor hydrolase is determined. 
Multiple, overlapping synthetic single-stranded DNA 
fragments are thereafter synthesized which upon 
hybridization and ligation produce a synthetic DNA 
5 encoding the precursor hydrolase. This approach 
provides several advantages over cloning the natural 
gene in that restriction sites may be interposed 
throughout the DNA without change in the amino acid 
sequence encoded so as to facilitate subsequent 

10 modification to form mutant carbonyl hydrolases. 
Further, the synthetic approach allows for adjusting the 
codon usage in the synthetic gene to conform with the 
codon bias for the particular expression hosts to be 
used. An example of synthetic gene construction is set 

15 forth in the Examples. 

Once the naturally-occurring or synthetic precursor 
carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the 
gene beyond synthesis of the naturally-occurring 
20 precursor carbonyl hydrolase. Such modifications 
include the production of recombinant carbonyl 
hydrolases as disclosed in EPO Publication Nos. 
0 130 756 and 0 251 446 and the production of carbonyl 
hydrolase mutants described herein. 

25 The following cassette mutagenesis method may be used 
to facilitate the construction and identification of the 
carbonyl hydrolase mutants of the present invention 
although other methods including site-directed 
mutagenesis may be used. First, the naturally-occurring 

30 gene encoding the hydrolase is obtained and sequenced 
in whole or in part. Then the sequence is scanned for 
a point at which it is desired to make a mutation 
(deletion, insertion or substitution) of one or more 
amino acids in the encoded enzyme. The sequences 

35 flanking this point are evaluated for the presence of 
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restriction sites for replacing a short segment of the 
gene with an oligonucleotide pool which when expressed 
will encode various mutants . Such restriction sites are 
preferably unique sites within the hydrolase gene so as 
5 to facilitate the replacement of the gene segment. 
However, any convenient restriction site which is not 
overly redundant in the hydrolase gene may be used, 
provided the gene fragments generated by restriction 
digestion can be reassembled in proper sequence. If 
10 restriction sites are not present at locations within 
a convenient distance from the selected point (from 10 
to 15 nucleotides), such sites are generated by 
substituting nucleotides in the gene in such a fashion 
that neither the reading frame nor the amino acids 
15 encoded are changed in the final construction. Mutation 
of the gene in order to change its sequence to conform 
to the desired sequence is accomplished by M13 primer 
extension in accord with generally known methods. The 
task of locating suitable flanking regions and 
20 evaluating the needed changes to arrive at two 
convenient restriction site sequences is made routine 
by the redundancy of the genetic code, a restriction 
enzyme map of the gene and the large number of different 
restriction enzymes. Note that if a convenient flanking 
25 restriction site is available, the above method need be 
used only in connection with the flanking region which 
does not contain a site. 

Once the naturally-occurring DNA or synthetic DNA is 
cloned, the restriction sites flanking the positions to 

30 be mutated are digested with the cognate restriction 
enzymes and a plurality of end termini-complementary 
oligonucleotide cassettes are ligated into the gene. 
The mutagenesis is enormously simplified by this method 
because all of the oligonucleotides can be synthesized 

35 so as to have the same restriction sites, and no 
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synthetic linkers are necessary to create the 
restriction sites. 

As used herein, proteolytic activity is defined as the 
rate of hydrolysis of peptide bonds per milligram of 
5 active enzyme. Many well known procedures exist for 
measuring proteolytic activity (K. M. Kalisz, "Microbial 
Proteinases" , Advances in Biochemical Engineering/ 
Biotechnology. A. Fiechter ed. , 1988). 

In one aspect of the invention, the objective is to 
10 secure a mutant carbonyl hydrolase having a greater 
(numerically large) proteolytic activity as compared to 
the precursor carbonyl hydrolase, thereby enabling the 
use of the enzyme to more efficiently act on a target 
substrate. Specific amino acids useful to obtain such 
15 results in subtil is in-type carbonyl hydrolases at 
residues equivalent to +123 in Bacillus 
flPYlP'Uquef&cieps subtilisin are presented in the 
Examples. In some instances, lower proteolytic activity 
may be desirable. In such cases a decrease in 
20 proteolytic activity can be produced by substituting the 
amino acids identified in the examples at residues 
equivalent to +123 in Bacillus amvloliouefaciens 
subtilisin. 

For precursor subtilisins wherein serine is not the 
25 residue at the position equivalent to +123 in Bacillus 
amvloliguefaciens the greatest proteolytic activity is 
obtained when serine is substituted in the precursor at 
position +123. Further, no naturally-occurring Bacillus 
subtilisin is known to exist which contains serine at 
30 a position equivalent to +123 in Bacillus 
amvlol i mief aciens subtilisin. Based on the discovery 
that serine at this position enhances proteolytic 
activity, one skilled in the art can screen naturally- 
occurring Bacillus subtilisin to identify and clone a 
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natural mutant containing serine at this position. Such 
natural Bacillus subtilisin mutants are within the scope 
of the invention. 

Where the carbonyl hydrolase is from other than Bacillus 
5 and a serine is present at +123 in the precursor enzyme 
the substitution can be one that decreases proteolytic 
activity. This would be useful, for example, where the 
synthetic activity of the carbonyl hydrolases is desired 
(as for synthesizing peptides) . One may wish to 
10 decrease this proteolytic activity which is capable of 
destroying the product of such synthesis. 



In another aspect of the invention, it has been 
determined that residues equivalent to +274 in Bacillps 
amyloliquefaciens subtilisin are important in modulating 

15 the overall performance characteristics of the enzyme 
in detergent compositions. Thus, as set forth in the 
Examples, the threonine in Bacillus lentus subtilisin 
at equivalent position +274 can be mutated to alanine 
in the preferred embodiment to produce enhanced 

20 performance of the mutant enzyme. As also disclosed in 
the Examples, substitution of this residue with an amino 
acid other than threonine, e.g. leucine, serine, valine 
and alanine results in a decrease in the stability of 
the mutant. Such decrease in stability is believed to 

25 be the result of autocatalytic degradation of the 
mutant. Thus, modifications of residues equivalent to 
+274 in Bacillus subtilisin are capable of enhancing the 
overall performance of the enzyme in a detergent 
composition and modulating the overall stability of the 

30 enzyme. In this aspect of the invention, the objective 
is to secure a mutant carbonyl hydrolase having enhanced 
performance when used in a detergent composition as 
compared to the precursor carbonyl hydrolase. As used 
herein, enhanced performance in a detergent is defined 

35 as increased cleaning of certain enzyme sensitive stains 



WO 91/06637 PCT/US90/06084 

-23- 

such as grass or blood. This cleaning is determined by 
visual evaluation after a standard wash cycle. 

A preferred embodiment of the invention is set forth in 
the Examples wherein the lysine at position 27 is 
5 substituted with arginine, the valine at position 104 
is substituted with tyrosine, the asparagine at position 
123 substituted with serine and the threonine at residue 
274 is substituted with alanine in Bacillus lentus 
subtilisin. Although the stability of this enzyme is 

10 somewhat reduced as compared to the precursor Bacillus 
lentus subtilisin, the performance level of this enzyme 
in a detergent composition is substantially enhanced 
such that the same performance of this Bacillus lentus 
subtilisin mutant is obtained as compared to the 

15 unmodified Bacillus lentus subtilisin when using 
approximately one-half the amount of enzyme. 

Based on the results obtained with this and other mutant 
subtil isins, it is apparent that residues in carbonyl 
hydrolases equivalent to positions +123 and +274 in 
20 Bacillus amvlolicr uef aciens are important to the 
proteolytic activity, performance and/or stability of 
these enzymes. 

Many of the carbonyl hydrolase mutants of the invention, 
especially subtilisin, are useful in formulating various 

25 detergent compositions. A number of known compounds are 
suitable surfactants useful in compositions comprising 
the carbonyl hydrolase mutants of the invention. These 
include nonionic, anionic, cationic, anionic, or 
zwitterionic detergents, as disclosed in U.S. 4,404,128 

30 to Barry J. Anderson and U.S. 4,261,868 to Jiri Flora, 
et al. The art is familiar with the different 
formulations which can be used as cleaning compositions. 



WO 91/06637 



-24- 



PCI7US90/06084 



Subtilisins of the invention can be formulated into 
known powdered and liquid detergents having pH between 
6.5 and 12.0 at levels of about .01 to about 5% 
preferably .1% to ,05%) by weight. These detergent 
5 cleaning compositions can also include other enzymes 
such as known proteases and amylases, as well as 
builders and stabilizers. 

The addition of subtilisins of the invention to 
conventional cleaning compositions does not create any 

10 special use limitation. In other words, any temperature 
and pH suitable for the detergent is also suitable for 
the present compositions as long as the pH is within the 
above range, and the temperature is below the 
subtilisins of the invention denaturing temperature. In 

15 addition, subtilisins of the invention can be used in 
a cleaning composition without detergents, again either 
alone or in combination with builders and stabilizers. 

The following is presented by way of example and is not 
to be construed as a limitation to the scope of the 
20 claims. 

EXAMPLE 1 

Constructions for Expression of Bacillus lentus 
Subtilisin Gene in Bacillus subtilis 

PlasmidpSAR, Fig. 5, carries a translational fusion via 
25 a common Sau3A restriction site at the seventh/eighth 
signal sequence codon of the subtilisin genes of B. 
subtilis and amvloliouefaciens. As shown in Fig. 5, 
this gene, on an EcoRI-BamHI 2.0 Kb fragment, was 
subcloned into M13mpl9 in order to isolate single- 
30 stranded template DNA to be used for site-directed 
mutagenesis to form pSAR-Q275R. The mutagenesis 
protocol was essentially that of Zoller, M. , et al. 
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( 1983 )» Methods Knzvmol . . i£P_, 468-500, (1) and used a 
synthetic oligonucleotide of the sequence: 

* ** 

5' -C-AAC-GTA-CAG-GCJL^CA^CT-CGC-TAA-AAC-ATA-A-3' 
5 Q275R 

where the asterisks denote changes from the wild-type 
gene sequences and the underline represents an 
introduced Pstl restriction endonuclease site used in 
screening for the particular mutant gene encoding the 

10 Q275R change. These changes were made to (1) convert 
the amino acid at this position to that found in 
Bacillus lentus subtil isin and (2) to allow hookup of 
the terminator in pSAR to the mature coding region of 
Bacillus lentus via a Pst site similarly introduced into 

15 pGG36 from Bacillus lentus (ATCC 21536). 

Plasmid pGG36, Fig. 5, contains a 2.1 kb genomic DNA 
fragment from Bacillus lentus (ATCC 21536) encoding the 
complete subtilisin gene which was cloned by standard 
methods in the shuttle vector pBS42. Band, L. , et al. 
20 (1984), Jm, 1, 17-21. 

The amino acid sequence for this subtilisin is the same 
as that disclosed for subtilisin 309 in PCT Publication 
No. 89/06279. This gene was subcloned into M13 as above 
for site-directed mutagenesis using an oligonucleotide 
25 of the sequence: 

■ * * * 

5' -C-AAT-GCA-GAA-G£Xl££A^£CT-CGC-TAA-TCA-A-3' 

T274A 

in order to 1) introduce a Pstl site at the same 
30 location in this gene corresponding to the site 
introduced into pSAR above and 2) to substitute the 
threonine at position 274 with alanine to form pGG36- 
T274A. 
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The mutant pSAR-Q275R and pGG36-T274A genes were 
individually subcloned back into pBS42 prior to 
Pstl/BamHI digestions, fragment isolation and ligation 
to produce plasmid GG-A274B. amy. term, as shown in 
5 Fig- 5, all by standard methods. 

A synthetic DNA linker was made by annealing 
complimentary single-stranded oligonucleotides of the 
sequences: 

5'-G-ATC-GTC-GCG-TCG-ACC-GCA-CTA-CTC-ATT-TCT-GTT- 
10 GCT-TTT-AGT-TCA-T-3' 

and 

5' -CGA-TGA-ACT-AAA-AGC-AAC-AGA-AAT-GAG-TAG-TGC- 
GGT-CGA-CGC-GAC-3' 

to give the double-stranded DNA fragment #2 shown in 
15 Fig. 6. The recessed left- and right-hand ends of this 
duplex linker are complimentary to the Sau3A end of 
fragment #1 (from pSAR) and the Clal end of fragment #3 
(from pGG-A274 B. amy. term), respectively. These 3 
fragments were combined with fragment 4 from pSAR-Q275R 
20 after restriction endonuclease digestions of plasmids, 
fragment isolation and ligation by standard methods to 
produce plasmid pGG-KVNA. The designation GG-KVNA 
indicates that this subtil isin contains the subtilisin 
encoded by pGG-36 which includes lysine (K) at position 
25 27, valine (V) at position 104, asparagine (H) at 
position 123 and the substitution of threonine at 
position 274 with alanine (A) . 
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EXAMPLE p 

Modification of Pfirc-vvyrfl 

As indicated in Fig. 6, the GG-KVNA gene (2.1 kb EcoRI- 
BamHI fragment) was subcloned into M13 for three 
5 successive rounds of site-directed mutagenesis using 
oligonucleotides having the sequence: 



* * * 

(a) 5' - GT-TCT-GCT-GTA-AGA*Gn»GCT-C TT-CTA-CA T.A(:A.fifrr. v 

K27R 



10 

(b) 5 ' - A - GTA - TTA - GGG -££Xi*££ - GGT • TCA - GGT • TCG - TAC - AGC - TCG -ATT - 3 ' 

V104y 

and 



** 



15 (c) 5' -GGG-AAC-AAT-GGA-ATG-CAC-CTT- GCT-AGC -TTG-ACT-TTA.V 

N123S 



The asterisks denote changes from the wild-type gene 
sequence. The underlines represent, in (a) an 
introduced Xbal site and in (b) and (c) introduced Nhel 
20 sites used to screen for the presence of the linked R27, 
Y104 and S123 mutations, respectively. In addition, in 
(c), the overlined denotes a destroyed SphI site. 
Finally, the 2.1 kb GG-RYSA gene was subcloned back into 
PBS42 for expression in subtil is hosts. 

25 The resultant plasmid was designated pGG-RYSA. This 
designation indicates that four residues were modified 
in the pGG-KVNA plasmid. Lysine (K) at position 27 to 
arginine (R) , valine (V) to tyrosine (Y) at position 104 
and asparagine (N) at position 123 to serine (S) . The 

30 alanine previously substituted at residue 274 was not 
modified in this procedure. 
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The lysine at position 27 was substituted with arginine 
based upon the amino acid sequencing of subtilisin 309. 
As indicated in PCT Publication No. WO89/06279, lysine 
is located at position 27. However, after independently 

5 sequencing this subtilisin protein, the initial data 
indicated that arginine was the residue at position 27. 
In the case of the substitution of tyrosine for valine 
at residue 104, the substitution was made to lower the 
pH activity profile and to increase the performance of 

10 the enzyme based on results previously obtained for 
paeillus amvlol ioruef aciens subtilisin (sometimes 
referred to as BPN 1 ) . The substitution of asparagine 
at position 123 with serine is based on the results 
obtained hereinafter wherein it was determined that the 

15 substitution of serine at position 123 maximized the 
proteolytic activity of the enzyme in a closely related 
mutant. 

EXftMPLP 3 

Construction of Synthetic 
20 pacillus lentus Subtilis in Gene 

DNA encoding the amino acid sequence of ^acillus lentus 
subtilisin was also prepared by constructing a gene 
encoding a synthetic DNA sequence. 

The 2.1 kb Hindlll genomic fragment from plasmid pGG36 
25 was sequenced. The deduced amino acid sequence of the 
mature gene product (GG36 subtilisin) was used to design 
a synthetic mature coding sequence with the following 
properties: (1) In general, the codons most frequently 
found for each amino acid in seven different fL. pvfrtlljs 
30 genes (from a tabulation of codon usages, Table 2 from 

Maruyama, T., et al., (1986), Fuel, ftglfls Rgp., 

Supplement 11 PP- rl51-rl97) were utilized except in the 
cases where alternate codons resulted in conveniently 
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located restriction enzyme recognition sites within the 
gene; (2) Approximately every 40-60 b.p. of the -0.8 
mature coding region, combinations of 2 or 3 
specifically chosen codons were utilized which resulted 
5 in the introduction of fairly evenly spaced, unique 
restriction sites. These sites were chosen to 
facilitate (a) later cassette mutagenesis and screening 
studies and (b) constructions involving more than one 
mutation; (3) A unique Pst I recognition site was 

10 designed to cover codons 272-274 allowing hook up to the 
terminator sequences- of a Bacillus amvloliouefar^Ang 
gene similarly modified over the same three codons and 
substituting threonine at position 274 with alanine; and 
(4) A unique Nrul site was introduced to cover mature 

15 codons residues 9-10 allowing hookup to GG36»s pre-pro 
coding sequence via a short synthetic duplex DNA linker. 
Based on this design, oligonucleotides ("oligos") were 
synthesized such that upon annealing the coding and non- 
coding oligos for a given -60 b.p. coding region, the 

20 resultant duplex DNA fragment would have at it's ends 
single stranded regions complimentary to the end of the 
next duplex fragment of the gene (see, Fig. 7). 

A total of 36 separate oligos (comprising 18 individual 
duplexes) were used in the scheme, as outlined above, 
25 resulting in an -0.8kb duplex synthetic mature coding 
region (Fragment 3 in Fig. 8) . 

Finally, one additional pair or synthetic oligo's was 
synthesized, which upon annealing (to give fragment 2 
in Fig. 8) has an Ncol *ite at it's 5' end 
30 (complimentary to GG36 's Ncol site at mature codons 5-6) 
and an Nrul site at its 3« end (complimentary to the 3's 
5' end of fragment 3). 



The final construction to give a complete expression 
unit consisting of a*, gub£His. promoter and the first 
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seven amino acids of the signal sequence hooked up to 
GG36's sequences encoding the remainder of the signal 
sequence, the complete pro sequence and the first six 
mature amino acids (Fragment 1 from GG-KVNA) , the 
5 synthetic gene encoding mature residues 7-274 (Fragments 

2+3) and the terminator region (including the final 
mature gene codon 279) of Bacillus amy3 01 1 mtt PCiens 

(fragment 4) was done as a four-way ligation as set 

forth in Fig. 6. 

10 Finally, three additional separate mutations were 
introduced into the mature coding region of this full 
length hybrid gene. The first substituted the lysine 
at position 27 with arginine. The second substituted 
the valine at position 104 with tyrosine. The third 

15 substituted the asparagine at position 123 with serine 
The resultant plasmid is designated pBC3-RYSA. The 
following example describes the method used to modify 
position 123 in the synthetic gene. Similar methods 
were used to modify positions 27 and 104 in this 

20 synthetic gene. 

EXAMPLE 4 

f9 n , fi-t!-ruction of Position 123 Mutants 

An Xho I site was introduced over codons 111/112 in the 
synthetic gene from Example 3 by making three 

25 phenotypically silent mutations via site directed 
mutagenesis (primer extension mutagenesis in M13) . The 
resulting plasmid, pX123 (Fig. 9) , was digested with Xho 
I and Ava I and the large vector-containing fragment 
isolated by electroelution from agarose gel. 

30 Complimentary synthetic oligonucleotides were annealed, 
ligated with the pX123 large fragment and transformed 
into £. coli strain HM294. These cassettes encoded, 
individually, all 20 naturally-occurring amino acids at 
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position 123, and in addition contained a silent 
mutation which destroyed a unique Sph I site lying 
between the Xho I and Ava I sites in pX123. Resulting 
plasmids from £. coli transf ormants were screened for 
5 the loss of the unique Sph I site. Positives by 
restriction analysis (i.e., Sph I negatives) were 
sequenced to confirm the presence of the desired 
position 123 mutations subcloned into the shuttle vector 
pBS42 and transformed into Bacillus subtilis BG2036 for 
10 expression. 

EXAMPLE 5 

Activity of Various +123 Mutants 

Proteolytic activity of each of the subtilisin mutants 
encoded by the above modified position +123 mutants was 

15 assayed by mixing 0.04 ml of supernatant from 
centrifuged culture broths with 0.56 ml of 1% w/v casein 
in 0.1M Tris pH8.60. After a 20 minute incubation at 
37 *C, reactions were quenched by precipitation with 10% 
trichloroacetic acid (TCA) . Activity was determined 

20 from the absorbance at a wavelength of 280nm for the 
supernatant after precipitation with 10% TCA. 
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TABLE I 

Relative proteolytic activity of codon 123 
variants normalized to Asn-123 mutant 

% Proteolytic 

5 Codon 123 ftcUYitY 



Ser 


116 


Asn 


100 


Cys 


22 


Gly 


12 


Ala 


9 


Thr 


7 


Gin 


7 


Val 


6 




<5 


He 


<5 


Trp 


<5 


Phe 


<5 


Asp 


<5 


His 


<5 


Leu 


<5 


Met 


<5 


Pro 


<5 


Tyr 


<5 



25 In the process of final confirmation of the DNA sequence 
of the synthetic gene coding for the enzyme BC3-RYSA, 
proline was found to be at position 78 instead of serine 
(the amino acid at this position in Bacillus lentus 
subtilisin) . The initial properties of the position 123 

30 mutations were tested in this enzyme, BC3-RPYA (proline 
at position 78). These results are shown in Table I. 
The amino acid at position 78 was thereafter changed 
back to serine to form the DNA and amino acid sequence 
shown in Fig. 10 by replacing the synthetic DNA duplex 

35 corresponding to that portion of the gene. 
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As can be seen the substitution of Asn with Ser at 
position +123 results in a substantial increase in 
proteolytic activity. The relationship between the 
various subtilisins discussed herein are summarized for 
5 positions 27, 78, 104, 123 and 274 in Table II. 



urn n 

position 







27 


78 


104 


123 


274 




6C36 (genomic) 


Lys(K) 


Ser(S) 


Val(V) 


Asn(N) 


Thr(T) 


10 


Synthetic 

B, J.entus pene 


Lys(K) 


Pro(P) 


Val(V) 


Asn(N) 


Ala (A) 




B . amylolique - 
facins .subtilisin 
(BPN) 


Lys(K) 


Ser(S) 


Tyr(Y) 


Asn(N) 


Ala(A) 


15 


Subtilisin 309 as 
published 


Lys(K) 


Ser(S) 


Val(V) 


Asn(N) 


Thr(T) 




Preferred 
embodiment herein 


Arg(R) 


Ser(S) 


Tyr(Y) 


Ser(S) 


Ala (A) 



EXAMPLE 6 



stability of P osition 274 Mutants 

20 Stability of position 274 mutants in BC3-EPY (arginine 
at position 27 , proline at position 78 , and tyrosine at 
position 104 in Bacillus lentus subtilisin) are shown 
in Table III. Data are percent activity remaining 
following incubation at 37 0 C in 50mM EDTA for 60 

25 minutes. 
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TAPLg in 
Amino Acid at 

Position 274 % Activity 

Leucine 2% 

5 Serine 79* 

Threonine 91% 

Valine 42% 

Alanine 43% 

Mutations at this position clearly effect stability of 
10 the enzyme. Although the alanine mutation was not as 
stable as serine or threonine at this position, this 
enzyme provided superior performance relative to 
p^plllvis lentus subtilisin under the conditions of use 
described. For different applications, other amino 
15 acids at position 274 may be used. 

EXAMPLE 7 
Detergent Comnosit 



A spray-dried phosphate detergent granule of the 
following composition was prepared: 
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Component Weight % 

Sodium C12 

linear alkylbenzene sulfonate 8.45 

Sodium Tallow Alcohol sulfate 4.23 

5 Sodium C14"15 linear alkyl sulfate 4.23 

Sodium Toluene Sulfonate 1-00 

Sodium Tripolyphosphate 5.60 

Sodium pyrophosphate 22.40 

Silicate (1.6 r) 5.50 

10 Sodium Sulfate 29.83 

Sodium polyacrylate (4500 MW) 1.17 

Brightener 0.22 

Sodium Carbonate 12.30 

Polyethylene Glycol (MW 8000) 0.47 

15 C12"13 alcohol 

polyethoxylate (6.5)* 0.50 

Miscellaneous + Water to 100% 

Protease** 0.034 



* Alcohol and monoethoxylate alcohol removed. 
20 **mg active enzyme/g (2.0 mg active enzyme/g stock) 

A 0.1 weight percent solution of this composition in 
water had a pH of 10.0. The composition with subtilisin 
mutant of the invention (Fig. 7) provided superior 
cleaning of enzyme-sensitive stains, when compared to 
25 Bacillus lentus at 0.068 mg active enzyme/g product, in 
a 95T (35*C) wash at 6 grains per gallon (gpg) hardness 
(3:1 Ca/Mg). 

Throughout this application reference is made to various 
amino acids by way of common one-and three-letter codes. 
30 Such codes are identified in Proteins: structures and 
Molecular Proteases , Thomas E. Creighton, eds. W.N. 
Freeman, N.Y., H.Y. (1983), p. 3. 
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Although the preferred form of the invention has been 
described above, it will be obvious to those skilled in 
the art to which the invention pertains, that, after 
understanding the invention as a whole, various changes 
5 and equivalent modifications may be made without 
departing from the scope of the invention as defined by 
the appended claims. 

All publications are expressly incorporated herein by 
reference. 
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1. A carbonyl hydrolase mutant having an amino acid 
sequence not found in nature which is derived form a 
precursor carbonyl hydrolase by substituting a different 

5 amino acid for the amino acid residue at a position in 
said precursor equivalent to +123 or +274 in Bacillus 
amvloliouefaciens subtilisin. 

2. The carbonyl hydrolase mutant of Claim 1 wherein 
said precursor carbonyl hydrolase is a subtilisin. 

10 3. The subtilisin mutant according to Claim 2 

wherein said substitution is at a position equivalent 
to +123. 

4. The subtilisin mutant of Claim 3 wherein the 
amino acid residue in said subtilisin mutant at said 

15 position is serine. 

5. The subtilisin mutant according to Claim 2 which 
is derived from a Bacillus subtilisin. 

6. A mutant Bacillus subtilisin exhibiting improved 
proteolytic activity which is derived from naturally- 

20 occurring or mutant precursor Bacillus subtilisin which 
has the amino acid residue at a position equivalent to 
+123 in paging amvloliouefaciens subtilisin changed 
to serine. 

7 . A Bacillus subtilisin having serine at a position 
25 equivalent to +123 in gacjUys amvlolimi e f»rH* ng 

subtilisin. 
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8. A mutant Bacillus subtilisin having the amino 
acid sequence: 

AQSVPWGOSRVQAPAAHNRGLTGSG^VAVLDTGISTHPDLNIRGGASFVPGE 
PSTQDGNGHGTHVAGTIAALNNSIGVLGVAPSAELYAVKVLGASGSGSYSSIA 
5 QGLEWAGNNGMHVASLSLGSPSPSATLEQAVNSATSRGVLWAASGNSGAGSI 
SYPARYAKAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYASLN 
GTSMATPHVAGAAALVKQKNPSWSlWQIRiraLKNTATSLGSTNLYG 
AAAR. 

9. DNA encoding the carbonyl hydrolase mutant of 
10 claim 1. 

10. Expression vector encoding the DNA of Claim 9. 

11. Host cell transformed with the expression vector 
of Claim 10. 

12* An enzymatic cleaning composition capable of 
15 degrading proteins comprising: 

a) a surfactant end; and 

b) a carbonyl hydrolase mutant having an amino acid 
sequence not found in nature which is derived form a 
precursor carbonyl hydrolase by substituting a different 

20 amino acid for the amino acid residue at a position in 
said precursor equivalent to +123 or +274 in Bacillus 
amvloliouefaciens subtilisin. 

13 . The composition of Claim 12 wherein said carbonyl 
hydrolase enzyme comprises a subtilisin. 

25 14. The composition of Claim 13 wherein said 
subtilisin has the following amino acid sequence: 

AQSVPWGOSRVQAPAAHNRGLTGSGVRVAVLDTGISTHPDLHIRGGASFVPGE 
PSTQDGNGHGTHVAGTIAALNNSIGVLGVAPSAELYAVKVLGASGSGSYSSIA 
QGLEWAGNNGHHVASLSI/3SPSPSATLEQAVNSATSRGVLVVAASGNSGAGSI 
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SYPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGV1WQSTYPGSTYASLN 

GTSMATPHVAGAAALVKQKNPSWSNVQIRNHLKNTATSLGSTNLYGSGLVNAE 
AAAR. 

15. The composition of claim 12 wherein the 
5 surfactant comprises a detergent. 

16. A composition according to Claim 15 comprising a 
spray-dried detergent granule. 



WO 91/06637 



1/14 



PCT/US90/06084 



X 
£ 
d 

n 



3 
> 
£L 

t-H 
U 



O Q||- 

^ s: id 
' <r 

ID 



ID 

cog 
m g 

ID 

<r 

<i 
<t 
<i. 
<: 
<t 

ID 
fr- 
<i 
<X 
<Z 
CJ 
ID 
fr— 



0* 



in 



a 
o. 



j 

La 

a 

Q 

j 



LxJ 

a. 



5 to 
a) 

a: 



S 5 



o 
u 

Ui 



a. JL 



ID 
ID 



I- 
CJ. 
I- 
13 
fr- 
O 
I— 

<r 
<: 

ID 
<Z 

o 
<c 
o 
<x 

fr— 

<: 



1- u 
Qi C-> 
XA fr— 

c <r 

XCJ 

fr- <: 

LCJ 

to <: 

— ' ID 
ID ID 

01 O 



d ID 
CJ 
<t ID 

«P ID 
0f|- 

ac<: 

1. ID 

xo 
t— <x 

o Of fr- 

I Il-fr- 

0f CJ 
— I- 
•-• <t 

3 <X 
Of 



C5 ID 
U 
<ID 

3<t 



d . 

Q_ a. fr- 

3 ID 

-J o 

3 ID 
Of 



Qi ID 



<x 




c/j <: 


1- 






<t 


O 


Of o 


1— 


o 


— t- 


cj 


▼HI-* ^ 


<: 


1 




i- 




aiD 


<: 




L. ID 


o 






CJ 






i- 






fr- 




d fr- 


<z 




> ID 


fr- 






1- 






<. 




>%<x 


1- 






<: 






<t 




in <t 


<t 




>»<: 


<t 




—1 <x 


fr- 






o 




>>o 


<c 




— ID 


H- 




ID ID 


O 






1— 




a>«£ 


ID 




L ID 


ID 




<: <: 



-HID 

Af 1— 

x<: 


d<t' 

<: id 




CLfr- 

111 *T 

<. ID 


LID 

XO 


CXCJ 

ui <t 

<t ID 




3 <t 
ID ID 


Of ID 

co^ 


— 

> ID 




3«t 
— * 

ID ID 


O +»ID 
vD 0ft- 
1 31 <t 


> < 
fr- fr— 


o 

1 


— 1- 
dH 
> ID 


XO 


m <i 

> <: 




> ^ 


ClD 
IDO 


01 CJ 

XI- 

a. fr- 




c5 fr- 
— O 
<t ID 


lfl«t 

>*<E 

-j<: 


c<: 

ID O 




— O 
> ID 


£h 
Q-fr- 


in id 
> <r 




1- CJ 
Of ID 

CO <C 


>0D 
-^ID 
ID ID 


o c < 
^- w <x 
1 ID O 




O ID 
C CJ 
Q.O 


-*o 

• CJ»- 
>ID 


ID 
d fr- 
> ID 




CLO 

m <: 

^ID 


PRO 
Tvr lie 
TAT ATT 


VI <z 

>»<: 




>*<: 
_j <i 


>»ID 
ID ID 




in <c 
> < 


tn<x 


>»CJ 
ID ID 




3 ID 
Of 1- 
_J fr- 


CD IAID 

>»<: 

1 -»<£ 


ifi <r 

-J <£ 


cu <x 

1 ID ID 


3< 

ID ID 


3 <t 
ID ID 




U1 <t 


>ID 
(DID 


e fr- 

0l CJ 

c/> i- 




-> <L 
cS 1- 
> ID 


CO 
U|<£ 
<t<t 


0f I— 
— ' I— 

« <: 




d I- 
CJ 
<I ID 


l.<t 
Of CJ 

COI— 


CJ 
> ID 




IA <t 
— J <C 


w<t 
><t 
_J<r 


O Q.I- 

m io< 

1 <C ID 




3 <L 

— ' <: 

ID ID 


>ID 
-'ID 
ID ID 


-J <: 




C CJ 

in < 

< <L 


d<: 

— 'CJ 


in id 
>%<t 
^ <: 




3 <t 
Of fr- 

_J 1— 


dlD 
— ^ CJ 
<tlD 


IAID 

>*<t 
-J <i 




c <: 
x o 
fr- <t 


ClD 
ID CJ 


U 
<t ID 




CS CJ 

I- 

<t ID 



o cJCJ 
00 ^ o 
I <ID 

c 

Of o 



— ' CJ 
<t ID 

C CJ 
Of ID 

wo <: 
on 

OJ 



o C < 

n (u cj 
i to fr- 

d i- 

— CJ 
<t ID 



OJ 

n 



<C 

V 
13 



•UBfiTinrrp cucrr 



WO 91/06637 



2/14 



PCT/US90/06084 



<: 
i 



C <E 

— <r 

ID O 


— <E 
d h- 
> ID 


d CJ 
— CJ 

<r cd 


in <r 
> <r 


-P CD 
Q> h- 


d <r 
— o 

<1 CD 


> \— 

-* CD 
t n i n 


1. h- 
tit U 
CO h- 


W ID 
> <t 
»— i <; 


— h- 
d h- 
> ID 


<c 
d I— 


c h- 

in <c 


d h- 

LZJ 


O 1— 
C CJ 

0L o 


in o 

— <: 


3 <I 
0; h- 
i i— 


in cj 
— <x 

T f 1 


d 1- 

w CJ 

i n 


c u 


d u 
— ' o 

<r cd 


c o 
> <t 
1— 1— 


3 ID 
Ld h- 

_J CJ 


c h- 
in <t 

<L CU 


L I— 

X CJ 


C CJ 

> <c 


d <: 
— o 

<t ID 


in <l 
> < 


>u 

— CD 
CD CD 


d )— 

— O 

<L L—J 


^ 1_ O 

LL. LJ 


> < 
— CD 


0> Q>|- 
— I CJ 


Of O 

•— « <: 


O CLh- 

^ m <i 
— « <: cd 


CD 
d h- 

> CD 


O I— 
C O 


in i— 
— < 


in o 
— < 


d C «X 
— ' Ql CJ 
^4. r~" 


d CD 
— U 
^ CD 


d h- 
> CD 


X CJ 
h- <I 


d O 
O 


C H- 
<U CJ 
(/) j— 


<U CJ 


C d <C 

Ql w CJ 

V^i L-J 


aiD 

L CD 


d < 

o 

^ CD 


C CJ 
Ql CD 
CO <x 


in <r 
>*<c 
_j ^ 


tH 
Qj CJ 
CO |— 


c U 

in <c 
<n <; 


C O 
Ql CD 


3 CD 
— <£ 
in t n 


d CD 

— O 
i n 


c <t 
Ql O 
tO h~* 


Q> h- 


CLH- 

in < 

<t VD 


a c o 
in in < 
^ ^ <i 


o < 
c o 

n cj 


Qi CJ 
— 1- 


in <: 
> <t 


L O 
Ql CD 
CO <I 


o c <r 
^ — <r 

ID O 


Op CJ 

<; 


O Q.CJ 

\d in <: 

<T" in 


d CD 
— O 
<r i n 

^-L Li/ 


o >,< 
— • — ID 
CD CD 


3 <t 
Qi H- 
— J h- 


VD CD 
»-* CD CD 


L <C 
Qi CJ 


— * ID 
ID ID 


c < 

— < 

in r * 


— 1— 

d h- 

^ L_J 


c o 
in < 


d 1- 

— o 
<; id 


X Qi O 
H- CO h— 


— <c 

CJ H- 
> ID 


L CJ 
0j id 
to <r 


Ql O 


CD 

t n. i n 

L— 1 N— ^ 


Qi h- 
f <I 


d 

— ' o 
^ CD 


L Lh 
Q; X O 
CO »— <c 


>U 
— ' CD 
ID ID 


in <x 


C O 1— 
U) CO 

LL. v-J 


3 <r 

Qi 1- 


Ql CJ 
— h- 


JL h- 
Qi O 
CO 1— 


>sO 
— ' CD 
CD CD 


1- O 

5n< 


Of o 

— I— 

I— • «Q. 


O c H— 

Q. ^ <; 


— <t 
d 

> CD 


aiD 

C ID 


— CD 
CD CD 


3 <L 
CD CD 


O H- 
C CJ 
Ql o 


o — h- 

> ID 


i- <c 

X CJ 

*— <x 


o >*!- 
CO — ' CD 
. CD ID 


C O 
Ql ID 

to <r 


CO Ql CJ 
CO |— 


C o 

in <t 
<: <t 


— ID 
d 1- 

*^ L_J 


d ID 
— ' CJ 
<£. ID 


3 <C 
— <t 
CD CD 


Q» O 


c o 


o h- 
t- o 
0- o 


CD CD 


1_ LJ 
Ql CJ 
(✓) |~ 


— <r 

d h~ 
> ID 


C H~ 


c <: 

Ql CJ 


•5 ^ 

CD O 


>%<c 

— ' CD 
CD CD 


d O 
< CD 


c ID 

— ' <: 


in <t 

j <£ 


o 

C CJ 

n. CJ 


C O 

in <c 

^L VL 


— CD 
i n ■ n 

LZJ L_J 


>»o 

— ' CD 
CD CD 


cd 


^ d CD 

— O 
<t ID 


— H- 
d h- 
> ID 


H- 
d 1- 
> ID 


C 1— 

in <: 


c o 

Ql U 


3 U 
Ql h- 

«J CJ 


d CD 
— O 
^ CD 


~* C CJ 


C h- 

in <t 
< <, 


O +» CD 

in <u »— 


3 h- 


«-i n in 


C O 
Ql CD 
vO <L 


in di- 

^ > CD 


d id 
— u 

^ ID 


L ^ 
Ql CJ 
(/) I— 


c o 

Of CD 
00 <I 


d H- 
— ' o 

<T" in 


d au 
— m <t 

vL w 


CD 
Ql 1— 


— ' O 

d h* 

> CD 


- < 
X CJ 


>»<r 

-J ID 
ID ID 


d CJ 
— U 

<T in 


d CD 

— o 

*t i n 


in — u 

^ ^ CD 


C O 

in < 


— ' <t 
d H- 
> ID 


d < 
CJ 
^ ID 


C 1— 

X CJ 


>.<r 

tD 
in in 


— 

d H- 

i n 
^ CD 


— CD 
ID CD 


Qi h- 
— »— 
+-* <. 


— O 
d H- 

> CD 


— ' <C 
d h- 
cq id 


C CJ 


>sO 
— CD 

CD <: 


X O 
H- <X 


3 O 
Qi H— 
-J O 


— ' h- 
d H- 
> CD 


>>o 

— ID 
CD CD 


in U 
— <t 
X CJ 


O >,CJ 
CU — * CD 
ID ID 


d < 
<I CD 


o >,o 

1^* — ' ID 
CD CD 


— h— 
d h- 
> CD 


O CLO 

cu in <r 

^ <E CD 


L O 
Qi O 
CO h- 


ON 

on 

CO 




ON 

in 


OJ 
VD 


ON 
ON 
vD 




ON 
00 



\3 



WO 91/06637 



3/14 



PCT/US90/06084 



O \— 
i- LJ 
CL O 


> h- 
— ' LD 
ID ID 


X u 

H- <C 


in <z 


> <r 

— ' ID 
ID LD 


C CJ 

w <r 
<c <r 


c cj 
<: <r 


Or U 

<c 


— <C 
> ID 


L O 
> <t 
h- h~ 


x u 
»- <r 


3 LD 
Ol h- 
-J O 


i. o 

til \D 
CO <Z 


d ID 
U 
<t ID 


Q-ID 

L. ID 


> LD 
w is 

LD LD 


o c < 

0> QiU 
00 t- 


>stD 
_ id 
ID ID 


O c o 
^ VI < 

cu <t <c 


VI <t 

> <z 


Oi U 
CL h- 


i. O 


0 ID 

1 o 
a. cj 


> < 

LD LD 


(Li O 
00 H- 


VI <I 
> 

-J <x 


U) (J 

£S 




d <r 
— a 
<: id 


c CJ 

VI <I 
< < 


10 19 
> <t 
_J < 


>•< 
1- h- 


CX<£ 
C ID 
<C < 


> <t 

— ' ID 

id a 


1- H- 
01 U 
00 (- 


0i LD 
X H- 
£L *- 


ID U 


O O h- 
^ L O 

oj a. o 


3 h 
0i K- 
_J O 


o t h 
SO Ol O 
OJ 00 H- 


C CJ 
VI <t 

<: <: 


-J o 


Ol t— 

— ' h- 
•-4 < 


vk <r 

<t LD 


0; ID 

oo <: 


i. ID 
X CJ 
1- < 


3 ID 
— 1 h- 


>h 

LD 
LD LD 


1. o 
01 ID 
00 <t 


C CJ 
01 LD 
00 <c 


d h- 
— ' O 
< LD 


3 i_ 

Ol 

-J o 


Q.CJ 

(A <: 

<C ID 


C <X 
3 U 


d |- 

— U 
<C ID 


VI <t 

> <t 
-J < 


O — H 
CO dh- 
> ID 


Ol CJ 
— j— 

^ <t 


o tD 

n d cj 

OJ <E LD 


L < 

X CJ 

1- <t 


d I— 

w (J 

<t ID 


L t— 

Ol CJ 
00 1- 


>»< 
ID 
LD ID 


X U 
1- < 


> u 

-* ID 
ID ID 


— ' < 
d f- 
> ID 


d O 
U 
< ID 


t. o 
X o 
*— <r 


_J <C 
d h- 
> ID 


>%o 

— ' ID 
LD a 


K 
d h- 
> LD 


C CJ 

in <: 
<r <r 


d <t 
L> 
<C ID 


O H- 

C o 

CL O 


U) U 
- <t 
X (_> 


C 3 < 
LD LD LD 


— H- 
«~» < 


2 

€=> — <_> 

OJ < LD 


O LD 
L O 
Q. O 


O 3 < 
in 01 (— 
OJ _| h- 


U 
d h- 

> ID 


«H ID 
01 1- 
3E < 


L H- 

01 U 
CO H- 


L 1- 
01 LD 

oo <r 


L. 

01 U 
00 1- 


— O 

>G 


d <X 
O 
< LD 


1. o 

0l LD 

00 <z 


Oh 

c u 

£u 


ai- 

vi <: 

<t ID 


•P LD 
Of h- 
X < 


o o 

L. LD 

<r o 


i. u 

> -a 
i- 


3 *- 
0* H- 
-J o 


e <i 

0l o 
00 r- 


— u 
d K 

> LD 


> « 


3 ID 
— < 
ID LD 


o L LD 
OJ X u 
OJ 1— <c 


C <t 
ID O 


CVJ 


ON 

ON 


o 


ON 



u 

CJ 
I— 
CJ 
LD 
O 
CJ 

< 

S 



LD 

r- 

<. 

CJ 
LD 
CJ 
CJ 

U 
CJ 



CJ 



LD 
CJ 
CJ 
LD 
CJ 

^- CJ 

a: <-> 

Ldg 



CJ 
CJ 
LD 
LD 
CJ 
CJ 
<Z 

<z 
<t 
<t 
<c 
<t 

<c 

CJ 



in C LD 

— <c 

OJ LD CJ 

d H- 

— CJ 
<C LD 

d <: 

CJ 
<X ID 

d ID 

CJ 

<r ld 
c <r 

LD CJ 

N dh 
OJ > LD 



OJ 
0J 



LD 
CJ 
CJ 
ID 
CJ 
h- 
<L 
<L 
O 
H- 
CJ 
H- 
LD 
O 
<C 
<t 

<r 

LD 
r- 
CJ 
CJ 
r- 
CJ 

<: 
o 

CJ 
LD 
LD 
CJ 
<L 
<C 
I- 
ID 
CJ 
CJ 
CJ 

I— 

ID 
<£ 
O 

13 

ID 
LD 
CJ 
U 
CJ 

<c 

LD 



LD 
ID 
LD 
O 
LD 
ID 
CJ 
^ 
<Z 
<C 
LD 

<: 

ID 
CJ 

<t 
<Z 



LD 



a 

CJ 
CJ 

I- 
a 

LD 
LD 
■h- 
<£ 
ID 
ID 
CJ 
<£ 
LD 
CJ 
I— 
<t 
<t 

vD 



LD 
LD 
CJ 

< 

r- 
ID 
CJ 



< 
CJ 
ID 
LD 
CJ 
<C 
LD 
<t 
LD 
ID 
ID 
CJ 
CJ 
<. 

^ 
LD 
r- 
CJ 
CJ 



ID 
CJ 
LD 
ID 
CJ 
LD 
LD 
CJ 
I- 
LD 
ID 
CJ 
< 

J— 
< 
CJ 
CJ 
ID 
I— 
<. 

5 

h- 
CJ 
LD 
<t 
CJ 
h- 
LD 
LD 
O 
CJ 



LD 
ID 
CJ 
CJ 
CJ 



CJ 

vD 
«— i 



y 



■IJRftTlTir-rc cun 



WO 91/06637 4/u PCT/US90/06084 

TDTALLY CONSERVED RESIDUES IN SUBTILISINS 

1 10 20 

. . . . P 

21 30 40 

• . G . i i i . , , tDiG ( i t • H • 

41 50 60 

• » G i • # i V • i 



• i i 



t i • 



* t i 



261 270 
N 



(■iiit* 



61 70 80 
HGTH..6 



81 90 100 

• •Gi..i,, lltl( VL..»G 

101 • 110 120 
SG G 

121 130 140 

L G 

141 150 160 

G GN 

161 170 180 
VP V. 



(■it 



• ■ i • i • 



FIG. -2 



■ i 



181 190 200 

• i • i S F S t 



201 210 220 

P G G T 

221 230 240 

SMA.PHVAG. 



i i 



241 250 260 



WO 91/06637 



5/14 



PCT/US90/06084 



Homology of Bacllur pro-teases 

1. Bacillus any lol I qui fact ens 

2. Bacillus sub-tills var.1168 

3. Bacillus I l chen If orn is (car Isbergensls) 



1 

1 
















10 


















20 




s 


v 


p 


Y 


G 


V 


S 


Q 1 


K 


A 


p 


A 


L 


H 


S 


Q 


G 


ft 0 


s 


v 


p 


Y 


G 


I 


s 


Q 1 


K 


A 


p 


A 


L 


H 


s 


Q 


G 


ft Q 


T 


V 


P 


Y 


G 


I 


P 


L 1 


K 


A 


r 


K 


V 


Q 


A 


Q 


G 


21 
















30 


















40 


Y T 


G 


s 


N 


V 


K 


V 


A 


V 1 


D 


S 


G 


1 


D 


S 


S 


H 


P 


Y T 


G 


s 


N 


V 


K 


V 


A 


V 1 


D 


S 


G 


1 


D 


S 


S 


H 


P 


F K 


G 


A 


N 


V 


K 


V 


A 


V L 


D 


T 


G 


1 


Q 


A 


S 


H 


P 


41 
















50 


















60 


D L 


K 


V 


A 


G 


G 


A 


S 


M V 


P 


S 


E 


T 


N 


P 


F 


Q 


D 


B L 


N 


V . 


R 


G 


G 


A 


s 


F V 


P 


S 


E 


T 


N 


P 


Y 


Q 


I) 


D L 


N 


V 


V 


G- 


G 


A 


s 


F V 


A 


G 


E 


A 


Y 


N 


T 


i 


D 


61 
















70 


















80 


N N 


S 


H 


- G 


T 


H 


V 


A 


G T 


V 


A 


A 


L 


N 


N 


S 


1 


G 


D L 


N 


V 


R 


G 


G 


A 


s 


F V 


P 


S 


E 


T 


N 


P 


Y 


Q 


D 


G N 


G 


H 


G 


T 


H 


V 


A 


G T 


V 


A 


A 


L 


D 


N 


T 


T 


G 


81 
















90 


















100 


V L 


G 


V 


A 


P 


S 


A 


S 


L Y 


A 


V 


K 


V 


L 


G 


A 


D 


G 


V L 


G 


V 


S 


P 


S 


A 


S 


L Y 


A 


V 


K 


V 


L 


I) 


S 


T 


G 


V L 


G 


V 


A 


P 


S 


V 


S 


L Y 


A 


V 


K 


V 


L 


N 


S 


S 


G 


101 
















110 


















120 


S G 


Q 


Y 


S 


V 


1 


1 


N 


G 1 


E 


w 


A 


I 


A 


N 


N 


M 


D 


S G 


Q 


Y 


S 


V 


1 


1 


N 


G 1 


E 


w 


A 


I 


S 


N 


N 


M 


D 


S G 


S 


Y 


s 


G 


1 


V 


S 


G 1 


E 


V 


A 


T 


T 


N 


G 


M 


D 



FIG.-3A 
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